Pell's equation

Pell's equation is any Diophantine equation of the form

x^2-ny^2=1\,

where n is a nonsquare integer. The word Diophantine means that integer values of x and y are sought. Trivially, x = 1 and y = 0 always solve this equation. Lagrange proved that for any natural number n that is not a perfect square there are x and y > 0 that satisfy Pell's equation. Moreover, infinitely many such solutions of this equation exist. These solutions yield good rational approximations of the form x/y  to the square root of n.

The name of this equation arose from Leonhard Euler's mistakenly attributing its study to John Pell. Euler was aware of the work of Lord Brouncker, the first European mathematician to find a general solution of the equation, but apparently confused Brouncker with Pell. This equation was first studied extensively in ancient India, starting with Brahmagupta, who developed the chakravala method to solve Pell's equation and other quadratic indeterminate equations in his Brahma Sphuta Siddhanta in 628, about a thousand years before Pell's time. His Brahma Sphuta Siddhanta was translated into Arabic in 773 and was subsequently translated into Latin in 1126. Bhaskara II in the 12th century and Narayana Pandit in the 14th century both found general solutions to Pell's equation and other quadratic indeterminate equations. Solutions to specific examples of the Pell equation, such as the Pell numbers arising from the equation with n = 2, had been known for much longer, since the time of Pythagoras in Greece and to a similar date in India.

For a more detailed discussion of much of the material here, see Lenstra (2002) and Barbeau (2003).

Contents

History

Pell's equations were studied as early as 400 BC in India and Greece. They were mainly interested in the equation

 x^2 - 2y^2=1 \,

because of its connection to the square root of two. Indeed, if x and y are integers satisfying this equation, then x / y is an approximation of √2. For example, Baudhayana discovered that x = 17, y = 12 and x = 577, y = 408 are two solutions to the Pell equation, and gave very close approximations to the square root of two.

Later, Archimedes used a similar equation to approximate the square root of 3, and found 1351/780.

Around AD 250, Diophantus created a different form of the Pell equation

 a^2 x^2%2Bc=y^2. \,

He solved this equation for a = 1, and c = −1, 1, and 12, and also solved for a = 3 and c = 9.

Brahmagupta created a general way to solve Pell's equation known as the chakravala method. Bhāskara I created a way to create new solutions to Pell equations from one solution. E. Strachey published the work of Bhāskara I in English in 1813. Alkarkhi worked on similar problems to Diophantus.

Brahmagupta discovered that (see Brahmagupta's identity)

(x_1^2 - Ny_1^2)(x_2^2 - Ny_2^2) = (x_1x_2 %2B Ny_1y_2)^2 - N(x_1y_2 %2B x_2y_1)^2.

Using this, he was able to "compose" triples (x_1, y_1, k_1) and (x_2, y_2, k_2) that were solutions of x^2 - Ny^2 = k, to generate the new triple

(x_1x_2 %2B Ny_1y_2 \,,\, x_1y_2 %2B x_2y_1 \,,\, k_1k_2).

Not only did this give a way to generate infinitely many solutions to x^2 - Ny^2 = 1 starting with one solution, but also, by dividing such a composition by k_1k_2, integer or "nearly integer" solutions could often be obtained. For instance, for N=92, Brahmagupta composed the triple (10, 1, 8) (since 10^2 - 92(1^2) = 8) with itself to get the new triple (192, 20, 64). Dividing throughout by 64 gave the triple (24, 5/2, 1), which when composed with itself gave the desired integer solution (1151, 120, 1). Brahmagupta solved many Pell equations with this method; in particular he showed how to obtain solutions starting from an integer solution of x^2 - Ny^2 = k for k=±1, ±2, or ±4.[1]

The first general method for solving the Pell equation (for all N) was given by Bhaskara II in 1150, extending the methods of Brahmagupta. Called the chakravala (cyclic) method, it starts by composing any triple (a,b,k) (that is, one which satisfies a^2 - Nb^2 = k) with the trivial triple (m, 1, m^2 - N) to get the triple (am %2B Nb, a%2Bbm, k(m^2-N)), which can be scaled down to

\left( \frac{am%2BNb}{k} \,,\, \frac{a%2Bbm}{k} \,,\, \frac{m^2-N}{k} \right).

When m is chosen so that (a+bm)/k is an integer, so are the other two numbers in the triple. Among such m, the method chooses one that minimizes (m²-N)/k, and repeats the process. This method always terminates with a solution (proved by Lagrange in 1768). Bhaskara used it to give the solution x=1766319049, y=226153980 to the notorious N=61 case.[1]

The general theory of Pell's equation, based on continued fractions and algebraic manipulations with numbers of the form P%2BQ\sqrt{a}, was developed by Lagrange in 1766–1769.[2]

Solutions

Fundamental solution via continued fractions

Let \tfrac{h_i}{k_i} denote the sequence of convergents to the continued fraction for \scriptstyle\sqrt{n}. Then the pair (x1,y1) solving Pell's equation and minimizing x satisfies x1 = hi and y1 = ki for some i. This pair is called the fundamental solution. Thus, the fundamental solution may be found by performing the continued fraction expansion and testing each successive convergent until a solution to Pell's equation is found.

As Lenstra (2002) describes, the time for finding the fundamental solution using the continued fraction method, with the aid of the Schönhage–Strassen algorithm for fast integer multiplication, is within a logarithmic factor of the solution size, the number of digits in the pair (x1,y1). However, this is not a polynomial time algorithm because the number of digits in the solution may be as large as √n, far larger than a polynomial in the number of digits in the input value n (Lenstra 2002).

Additional solutions from the fundamental solution

Once the fundamental solution is found, all remaining solutions may be calculated algebraically as

x_k %2B y_k\sqrt n = (x_1 %2B y_1\sqrt n)^k.

Equivalently, we may calculate subsequent solutions via the recurrence relations

\displaystyle x_{k%2B1} = x_1 x_k %2B n y_1 y_k,
\displaystyle y_{k%2B1} = x_1 y_k %2B y_1 x_k.

An alternative method to solving, once finding the first non-trivial solution, one could take the original equation x^2 - ny^2 = 1 and factor the left hand side as a difference of squares, yielding (x %2B y\sqrt n)(x - y\sqrt n) = 1. Once in this form, one can simply raise each side of the equation to the kth power, and recombining the factored form to a single difference statement. The solution s will be of the form (x-s)^k %2B n*(y-s)^k = 1.

Concise representation and faster algorithms

Although writing out the fundamental solution (x1,y1) as a pair of binary numbers may require a large number of bits, it may in many cases be represented more compactly in the form

x_1%2By_1\sqrt n = \prod_{i=1}^t (a_i %2B b_i\sqrt n)^{c_i}

using much smaller coefficients ai, bi, and ci.

For instance, Archimedes' cattle problem may be solved using a Pell equation, the fundamental solution of which has 206545 digits if written out explicitly. However, instead of writing the solution as a pair of numbers, it may be written using the formula

x_1%2By_1\sqrt n=u^{2329},

where

u = (x'_1%2By'_1\sqrt{4729494})

and \scriptstyle x'_1 and \scriptstyle y'_1 only have 45 and 41 decimal digits, respectively. Alternatively, one may write even more concisely

u = (300426607914281713365\sqrt{609}%2B84129507677858393258\sqrt{7766})^2.

(Lenstra 2002).

Methods related to the quadratic sieve approach for integer factorization may be used to collect relations between prime numbers in the number field generated by √n, and to combine these relations to find a product representation of this type. The resulting algorithm for solving Pell's equation is more efficient than the continued fraction method, though it still does not take polynomial time. Under the assumption of the generalized Riemann hypothesis, it can be shown to take time

\exp O(\sqrt{\log N\log\log N}),

where N = log n is the input size, similarly to the quadratic sieve (Lenstra 2002).

Quantum algorithms

Hallgren (2007) showed that a quantum computer can find a product representation, as described above, for the solution to Pell's equation in polynomial time. Hallgren's algorithm, which can be interpreted as an algorithm for finding the group of units of a real quadratic number field, was extended to more general fields by Schmidt & Völlmer (2005).

Example

As an example, consider the instance of Pell's equation for n = 7; that is,

\displaystyle x^2 - 7 y^2 = 1.

The sequence of convergents for the square root of seven are

h / k (Convergent) h2 −7k2 (Pell-type approximation)
2 / 1 −3
3 / 1 +2
5 / 2 −3
8 / 3 +1

Therefore, the fundamental solution is formed by the pair (8, 3). Applying the recurrence formula to this solution generates the infinite sequence of solutions

(8, 3); (127, 48); (2024, 765); (32257, 12192); (514088, 194307); (8193151; 3096720); (130576328, 49353213); ...

Connections

Pell's equation has connections to several other important subjects in mathematics.

Algebraic number theory

Pell's equation is closely related to the theory of algebraic numbers, as the formula

x^2 - n y^2 = (x %2B y\sqrt n)(x - y\sqrt n)

is the norm for the ring \mathbb{Z}[\sqrt{n}] and for the closely related quadratic field \mathbb{Q}[\sqrt{n}]. Thus, a pair of integers (x, y) solves Pell's equation if and only if x %2B y \sqrt{n} is a unit with norm 1 in \mathbb{Z}[\sqrt{n}]. Dirichlet's unit theorem, that all units of \mathbb{Z}[\sqrt{n}] can be expressed as powers of a single fundamental unit (and multiplication by a sign), is an algebraic restatement of the fact that all solutions to the Pell equation can be generated from the fundamental solution. The fundamental unit can in general be found by solving a Pell-like equation but it does not always correspond directly to the fundamental solution of Pell's equation itself.

Chebyshev polynomials

Demeyer (2007) mentions a connection between Pell's equation and the Chebyshev polynomials: If Ti (x) and Ui (x) are the Chebyshev polynomials of the first and second kind, respectively, then these polynomials satisfy a form of Pell's equation in any polynomial ring R[x], with n = x2 − 1:

T_i^2 - (x^2-1) U_{i-1}^2 = 1. \,

Thus, these polynomials can be generated by the standard technique for Pell equations of taking powers of a fundamental solution:

T_i %2B U_{i-1} \sqrt{x^2-1} = (x %2B \sqrt{x^2-1})^i. \,

It may further be observed that, if (xi,yi) are the solutions to any integer Pell equation, then xi = Ti (x1) and yi = y1Ui − 1(x1) (Barbeau, chapter 3).

Continued fractions

A general development of solutions of Pell's equation in terms of continued fractions can be presented, as the solutions x and y are approximates to the square root of n and thus are a special case of continued fraction approximations for quadratic irrationals.

The relationship to the continued fractions implies that the solutions to Pell's equation form a semigroup subset of the modular group. Thus, for example, if p and q satisfy Pell's equation, then

\begin{pmatrix} p & q \\ nq & p \end{pmatrix}

is a matrix of unit determinant. Products of such matrices take exactly the same form, and thus all such products yield solutions to Pell's equation. This can be understood in part to arise from the fact that successive convergents of a continued fraction share the same property: If pk−1/qk−1 and pk/qk are two successive convergents of a continued fraction, then the matrix

\begin{pmatrix} p_{k-1} & p_{k} \\ q_{k-1} & q_{k} \end{pmatrix}

has determinant (−1)k.

Størmer's theorem applies Pell equations to find pairs of consecutive smooth numbers. As part of this theory, Størmer also investigated divisibility relations among solutions to Pell's equation; in particular, he showed that each solution other than the fundamental solution has a prime factor that does not divide n.

As Lenstra (2002) describes, Pell's equation can also be used to solve Archimedes' cattle problem.

The negative Pell equation

The negative Pell equation is given by

 x^2 - ny^2 = -1 \, (eq.1)

It has also been extensively studied; it can be solved by the same method of using continued fractions and will have solutions when the period of the continued fraction has odd length. However we do not know which roots have odd period lengths so we do not know when the negative Pell equation is solvable. But we can eliminate certain n since a necessary but not sufficient condition for solvability is that n is not divisible by a prime of form 4m+3. Thus, for example, x2-3py2 = -1 is never solvable, but x2-5py2 = -1 may be, such as when p = 1 or 13, though not when p = 41.

Cremona & Odoni (1989) demonstrate that the proportion of square-free n for which the negative Pell equation is soluble is at least 40%. If it does have a solution, then it can be shown that its fundamental solution leads to the fundamental one for the positive case by squaring both sides of eq.1,

 (x^2 - ny^2)^2 = (-1)^2 \,

to get,

 (x^2 %2B ny^2)^2 - n(2xy)^2 = 1 \,

Or, since ny2 = x2+1 from eq.1, then,

 (2x^2 %2B 1)^2 - n(2xy)^2 = 1 \,

showing that fundamental solutions to the positive case are bigger than those for the negative case.

Transformations

I. The related equation,

 u^2 - dv^2 = \pm 2 \, (eq.2)

can be used to find solutions to the positive Pell equation for certain d. Legendre proved that all primes of form d = 4m + 3 solve one case of eq.2, with the form 8m + 3 solving the negative, and 8m + 7 for the positive. Their fundamental solution then leads to the one for x2dy2 = 1. This can be shown by squaring both sides of eq. 2,

 (u^2 - dv^2)^2 = (\pm 2)^2 \,

to get,

 (u^2 %2B dv^2)^2 - d(2uv)^2 = 4 \,

Since  dv^2 = u^2 \mp 2 from eq.2, then,

 (2u^2 \mp 2)^2 - d(2uv)^2 = 4 \,

or simply,

 (u^2 \mp 1)^2 - d(uv)^2 = 1 \,

showing that fundamental solutions to eq.2 are smaller than eq.1. For example, u2-3v2 = -2 is {u,v} = {1,1}, so x2 − 3y2 = 1 has {x,y} = {2,1}. On the other hand, u2 − 7v2 = 2 is {u,v} = {3,1}, so x2 − 7y2 = 1 has {x,y} = {8,3}.

II. Another related equation,

 u^2 - dv^2 = \pm 4 \, (eq.3)

can also be used to find solutions to Pell equations for certain d, this time for the positive and negative case. For the following transformations,[3] if fundamental {u,v} are both odd, then it leads to fundamental {x,y}.

1. If u2 − dv2 = −4, and {x,y} = {(u2 + 3)u/2, (u2 + 1)v/2}, then x2 − dy2 = −1.

Ex. Let d = 13, then {u,v} = {3, 1} and {x,y} = {18, 5}.

2. If u2 − dv2 = 4, and {x,y} = {(u2 − 3)u/2, (u2 − 1)v/2}, then x2 − dy2 = 1.

Ex. Let d = 13, then {u,v} = {11, 3} and {x,y} = {649, 180}.

3. If u2 − dv2 = −4, and {x,y} = {(u4 + 4u2 + 1)(u2 + 2)/2, (u2 + 3)(u2 + 1)uv/2}, then x2 − dy2 = 1.

Ex. Let d = 61, then {u,v} = {39, 5} and {x,y} = {1766319049, 226153980}.

Especially for the last transformation, it can be seen how solutions to {u,v} are much smaller than {x,y}, since the latter are sextic and quintic polynomials in terms of u.

Notes

  1. ^ a b John Stillwell (2002), Mathematics and its history (2 ed.), Springer, pp. 72–76, ISBN 9780387953366, http://books.google.com/?id=WNjRrqTm62QC&pg=PA72 
  2. ^ Solution d'un Probleme d'Arithmetique, in Oeuvres, t.1, 671–732
  3. ^ A Collection Of Identities: Pell Equations

References

External links